Problem 1

The bar chart and the pie chart below show the distribution of pre-existing medical conditions of children involved in a study on the optimal duration of antibiotic use in treatment of tracheitis, which is an upper respiratory infection.

  1. What features are apparent in the bar chart but not in the pie chart?
  2. What features are apparent in the pie chart but not in the bar chart?
  3. Which graph would you prefer to use for displaying these categorical data?

Problem 2

  1. Describe the shape of the distribution of velocities. Specifically address the number of modes, symmetry/skew, and any unusual features present.
  2. What features are apparent in the histogram but not in the boxplot?
  3. Which graph would you prefer to use for displaying these data? Justify your answer.

Problem 3

  1. Describe the shape of the distribution of velocities. Specifically address the number of modes, symmetry/skew, and any unusual features present.
  2. Below is a density plot displaying the same data. What features are easier to see in the density plot? What features are easier to see in the histogram?

  1. Below is a side-by-side (i.e. faceted) histogram of movie lengths based on whether the film is classified as a “short” film. Have films been classified as a “short” in a consistent way? Explain briefly.
## 
## Attaching package: 'dplyr'
## The following object is masked from 'package:gridExtra':
## 
##     combine
## The following object is masked from 'package:MASS':
## 
##     select
## The following objects are masked from 'package:stats':
## 
##     filter, lag
## The following objects are masked from 'package:base':
## 
##     intersect, setdiff, setequal, union
## Warning: Removed 392 rows containing non-finite values (stat_bin).

## Warning: Removed 392 rows containing non-finite values (stat_bin).

Problem 4

## 
## Attaching package: 'GGally'
## The following object is masked from 'package:dplyr':
## 
##     nasa

## `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
## `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
## `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
## `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
## `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
## `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
## `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
## `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.